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1. Introduction 


Wind power is one of the most rapidly growing renewable 
energy sources, and is regarded as an appealing alternative to 
conventional power generated from fossil fuel. This led to a 
collaborative effort to achieve 20% of U.S. electricity supplied from 
wind power by 2030 [1]. Although the integration of wind power 
brings many advantages, high penetration of wind power provides 
a number of challenges in power system operations and planning, 
mainly due to its uncertain and intermittent nature. In the 
electricity system the power supply must be equal to the power 
demand at all times. However, the variation of wind power output 
makes it difficult to maintain this balance. 

One of the possible solutions to the balance challenge is to 
improve the wind speed and power forecasting. Research in the 
area of forecasting wind speed or the power produced by wind 
farms has been devoted to the development of effective and 
reliable tools and many different approaches have been proposed 
and reviewed in [2-13]. Accurate forecasting tools reduce operat- 
ing costs and improve reliability associated with the integration of 
wind power into the existing electricity supply system [14-29]. 

There are different users of wind speed and power forecasts. These 
users not only need point forecasts but also the uncertainty of the 
forecast is essential for determining the size of the operating reserves 
necessary to balance the generation with load. For the market 
operator, the forecast of the total wind power production in a region 
is needed more than the individual wind plant forecast. For the system 
planner, it is essential to plan the installation of new wind turbines. 
The desired forecasting capabilities typically vary among user groups. 
This paper divides the forecasting methods into five categories: wind 
speed and power forecasting, spatial correlation forecasting, regional 
forecasting, probabilistic forecasting, and offshore forecasting. 

A large amount of research has been directed toward the 
development of accurate and reliable wind speed and power 
forecasting models and many different approaches have been 
developed. However it is difficult to draw conclusions as to which 
model is the best because each model in use has significant site 
dependencies. Thus, a forecasting model may perform well at its 
site, but this does not guarantee that the model will work well at 
another site. This paper discusses forecasting accuracy based on 
variable factors that are used in the forecast. 

Furthermore, several potential techniques for improving fore- 
casting accuracy have been reported in the literature. These are 
also discussed and reviewed in this paper. 

The paper is organized as follows. Section 2 presents an overview 
of existing wind speed and power forecasting approaches. In Section 
3, the general forecasting accuracy is discussed. Some of the potential 
techniques to improve the performance of forecasting models are 


presented in Section 4. Finally, conclusions and future works are 
drawn in Section 5. 


2. Overview of current wind speed and power forecasting 
2.1. Wind speed and power forecasting 


The basic role of wind speed and power forecasting is to 
provide information about the wind speed and power that can 
be expected in the next few minutes, hours, or days. Based on 
power system operation requirements, the forecast can be divided 
into four different horizons: very short-term (few seconds to 
30 min), short-term (30 min to 6h), medium-term (6-24 h), and 
long-term (1-7 days) [10,11]. Very short-term forecasts are used 
for turbine control and load tracking. Short-term forecasts are 
utilized for preload sharing. Medium-term forecasts are used for 
power system management and energy trading. Long-term fore- 
cast are used for maintenance scheduling of the wind turbines. 

Research in the area of forecasting wind speed and the power 
produced by wind farms has been devoted to the development of 
effective and reliable tools and many different approaches have 
been proposed. These tools can be classified whether the terrain 
information at the location is used as an input or not. Two 
mainstream approaches are the physical and the statistical 
approach. In some models a combination approach is used in an 
attempt to integrate the advantages of both approaches. In this 
section an overview of existing wind speed and power forecast 
approaches is presented. 


2.1.1. Physical forecasting approach 

The physical approach to forecasting, in contrast to statistical 
approach, uses the detailed physical description to model the on- 
site conditions at the location of the wind farm [7,30]. The basic 
operation of a physical approach is illustrated in Fig. 1. 

It carries out the refinement of the Numerical Weather Predic- 
tion (NWP) data to take into account the on-site conditions by the 
downscaling method, which are based on the physics of the lower 
atmospheric boundary layer. The downscaling method requires 
the detailed physical descriptions of the wind farms and their 
surroundings, including: description of the wind farm (wind farm 
layout and wind turbine power curve, etc.) and description of the 
terrain (orography, roughness, obstacles, etc.). Then, the refined 
wind speed data at the hub height of the wind turbines is plugged 
into the corresponding wind power curve to calculate the wind 
power production. If the on-line data is available, model output 
statistics are performed to reduce the error of the forecast. 
Contrary to the statistical approach, the physical approach does 
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Fig. 1. The physical approach to forecasting wind speed and power. 


not require training input from historical data. However, acquiring 
the physical data is one of the main drawbacks of the approach. 

A number of physical approaches have been introduced 
[2,7,8,31,32]. The Prediktor is developed by the Risoe National 
Laboratory in Denmark. It uses Wind Atlas Analysis and Applica- 
tion Program (WAsP) and PARK program to take the local condi- 
tions into account by using the NWP forecast from High Resolution 
Limited Area Model (HIRLAM) [33-35]. The Previento, developed 
by the University of Oldenburg in Germany has a similar physical 
approach but uses a different NWP forecast from Lakelmodell of 
the German Weather Service [36]. The LocalPred is developed by 
CENER - National Renewable Energy Centre in Spain. It involves 
adaptive optimization of the NWP forecast, time series modeling, 
meso-scale modeling with MM5, and power curve modeling [37]. 
The eWind, developed by AWS TrueWind Inc. in the USA, has a 
similar physical approach with Prediktor but uses a high-resolution 
boundary layer model (ForeWind) as a numerical weather model 
to take the local conditions into account [38]. 

The physical approaches are based on the models using the 
fundamental physical principles for conservation of mass, momentum, 
and energy in air flows. These models address computational fluid 
dynamics (CFD) for simulating the atmosphere. Although there are 
many CFD models available, they are all based on the same basic 
physical principles. They differ in how the grids are structured and 
scaled, and how the numerical computations are performed. 

In the majority of cases the statistical approaches provide good 
results in short-term, medium-term, and long-term forecasting. 
However, in the very short-term and short-term horizon, the influ- 
ence of atmospheric dynamics becomes more important, and in these 
cases the use of physical approaches becomes essential. 


2.1.2. Statistical forecasting approach 
The alternative main approach for wind speed and power fore- 
casting is based on statistical modeling. The statistical approach 


represents the relation between wind power or speed forecasting 
and explanatory variables including NWPs and on-line measured data 
[7]. The general form of the model is illustrated in Fig. 2. 

The statistical approach generally uses previous history data to 
build the statistical model. This model uses NWP forecast for time 
t+k and on-line measurement at time t to forecast the present 
over the next few hours. It is easy to model and inexpensive. 
However, contrary to the physical approach, the statistical 
approach requires historical data to train the statistical model. 
Many different approaches are employed [2,3,5,6,11,13]. Some of 
the representative statistical approaches are reviewed in this 
section. 


2.1.2.1. Conventional statistical approach. In the conventional 
statistical approach a time series model is applied to forecast 
future wind power or speed. According to the forecasting process, 
which was proposed by Box-Jenkins, this model is divided into 
four main steps to make a mathematical model of the problem 
including model identification, model estimation, model 
diagnostics checking, and forecasting. Several types of time 
series models may be considered, including autoregressive 
model (AR), moving average model (MA), autoregressive moving 
average model (ARMA), and autoregressive integrated moving 
average model (ARIMA). The general form of the model is 


Pp q 
Xt=C+er+ DY piXt-it DY Ger-i (1) 
t= i=1 


where X; represents the forecasting parameter at time t, g; is the 
autoregressive parameter, 6; is the moving average parameter, c is 
the constant, and random variable e; is the white noise. This model 
represents the ARMA model having the autoregressive model of 
order p and the moving average model of order q (ARMA(p,q)). If 
the order of the moving average model (q) is zero, it represents the 
autoregressive model of order p (AR(p)). If the order of the 
autoregressive model (p) is zero, it represents the moving 
average model of order q (MA(q)). The ARIMA model is a 
generalization of an ARMA model. 

The ARMA model is used for wind power forecasting in U.S. 
wind farms in [39,40]. Some of the models offer improvements 
over the persistence model but some are not by changing the 
order of AR and MA. The performance of the model is highly 
dependent on the parameters of the model. 

The Improved Time Series Method (ITSM) based on ARIMA is 
used in [41]. The sub-wind speed series obtained from the wavelet 
decomposition is forecasted and then the aggregate calculation of 
the sub-series is performed to obtain the final wind speed forecast. 
The simulation results indicate that the proposed method 
improves the accuracy of forecasting over the classical time series 
model and the neural network model. 

The Autoregressive Conditional Heteroscedastic (ARCH) model 
is combined with the ARIMA model to consider the heteroscedas- 
ticity effect of the residual series [42]. The ARIMA-ARCH model is 
used to forecast the sub-wind speed series obtained from the 
wavelet decomposition. The final wind speed forecasting is the 
sum of these forecast values. The results show that it can improve 
the accuracy of the forecasting. In [43], the ARIMA-ARCH model is 
used to forecast wind speed itself. It is shown that the ARIMA- 
ARCH model shows better performance than the ARIMA model. 

To deal with the non-stationary of wind speed, wind speed 
data are transformed to a Gaussian distribution and standardized 
to remove the non-stationary in [44]. The parametric AR model is 
then applied to the hourly wind speed data and is updated during 
the real-time operation by a recursive least squares algorithm. The 
method is validated by using the wind speed data from a wind 
power site. 
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Fig. 2. The statistical approach to forecasting wind speed and power. 


In [45], the ARMA model is used to forecast the tuple of wind 
speed and direction. Four different approaches are proposed and 
their performances are compared; component, traditional-linked 
ARMA, vector AR (VAR), and restricted VAR approaches. The 
component approach is better than traditional-linked ARMA for 
wind direction forecasting but not for wind speed forecasting. The 
VAR approach offers better performance in wind direction and 
close performance in wind speed comparing with traditional- 
linked ARMA. There is little difference in terms of forecasting 
performance between VAR and restricted VAR approaches. 

The AR model using a Bayesian approach is used to forecast the 
wind speed in [46]. First, Box-Cox transformation is performed to 
correct the non-normality of the wind speed. Then, the Markov 
Chain Monte Carlo (MCMC) simulation is performed to estimate 
the AR model parameters, which can then be used to forecast wind 
speeds. It shows the great potential and possible improvements of 
a Bayesian framework. 

In summary, conventional statistical approaches are based on 
classical linear statistical models such as AR, MA, ARMA, and the 
Box-Jenkins approach, where Box-Jenkins is based on ARIMA or 
seasonally adjusted ARIMA models, also known as SARIMA mod- 
els. However, the references show that the forecasting accuracy 
can be improved depending on the model parameters that are 
selected. 

The conventional statistical approaches are mostly aimed at 
very short-term and short-term forecasting. These models are easy 
to formulate and are capable of providing timely forecasts. As 
forecasting accuracy is improved, conventional statistical 
approaches are often used as a reference model. 


2.1.2.2. Artificial Neural Network approach. Another approach is 
Artificial Neural Networks (ANN). The NWP forecasts and further 
meteorological variables are transformed into the wind power or 
speed forecast by ANN which has been trained by the large sets of 
historical data in order to learn the dependence of the output on 
input variables. The general ANN approach for wind speed and 
power forecast is shown in Fig. 3. 


ANN is one of the widely used statistical approaches for wind 
speed and power forecasts. It consists of an input layer, one or 
more hidden layers, and an output layer. Each layer has a number 
of artificial neurons, and it uses a connectionist approach to 
connect the neurons to the neurons of the previous layer. This 
approach is able to model the complex non-linear relationship 
between the input and output layers through a training and 
learning process. This approach does not require explicit mathe- 
matical expressions as used in the physical and statistical 
approaches reviewed previously. Furthermore, it has the ability 
of self-learning, self-organizing and self-adaption. 

In [47], the ANN methodology using the Markov Chain 
approach is proposed for very short-term wind speed forecast. 
The Markov Chain is applied to modify the preliminary forecast 
obtained from ANN according to the long-term patterns. A data set 
with 2.5s resolution is used to evaluate the performance. The 
results show the effectiveness of the proposed method. 

The ANN methodology using Particle Swarm Optimization 
(PSO) is proposed for short-term wind power forecast in [48]. 
PSO is employed to select the input variable from the several 
nearby locations. The proposed method shows the reduction of the 
forecast error comparing with ANN. 

Furthermore, the forecast method composed of a new 
Enhanced Particle Swarm Optimization (EPSO) technique and 
Modified Hybrid Neural Network (MHNN) is proposed to have 
high learning capability and avoid the overfitting problem and 
trapping in local minima and dead bands in [49]. The effective 
two-stage feature selection technique, enhanced Mutual Informa- 
tion (MI), is applied to select the most informative candidate 
inputs by filtering out both irrelevant and redundant candidate 
inputs for the forecast method. The results confirm the validity of 
the developed approach. 

In [50], the forecast method based on Ridgelet Neural Network 
(RNN) is proposed as the forecast engine. The inputs of the forecast 
engine are selected by the Mutual Information (MI) based feature 
selection component among the set of candidate inputs. A New 
Differential Evolution (NDE) algorithm with novel crossover opera- 
tor and selection mechanism is proposed to train the RNN. The 
results show the effectiveness of the proposed method. 
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Fig. 3. ANN approach for wind speed and power forecast. 


In [51], a comprehensive comparison study of typical three 
types of ANN approach is performed; Feed Forward Back- 
Propagation (FFBP), Radial Basis Function (RBF), and Adaptive 
Linear Element (ADALINE) approaches. It is found that different 
inputs and learning rates, as well as model structures, directly 
influence the forecast accuracy. Furthermore, none of the 
approaches outperform others universally based on the evaluation 
criteria even for the same wind dataset. 

Complex-Valued Neural Network (CVNN) is proposed for the 
wind power forecast in [52]. The CVNN used the wind vector (both 
wind speed and direction) rather than real-valued data as an 
input. The results show the improvement in the forecast accuracy 
than Real-Valued Neural Network (RVNN). 

In [53], an ANN approach in conjunction with the adaptive 
Bayesian learning and the Gaussian process approximation is 
proposed for very short-term wind power forecast. The Bayesian 
framework is employed updating to calculate the posterior prob- 
ability distribution for the weights of the model. The Gaussian 
process approximation makes the calculation of this posterior 
probability possible to solve an integration problem of the Baye- 
sian learning. The Bayesian framework provides not only point 
forecast but also the interval about the forecast. 

A three-layered Feed-forward NN trained by the Levenberg- 
Marquardt algorithm is considered to forecast the wind power 
sub-series by Wavelet Transform (WT) in [54]. The obtained 
forecast sub-series reconstruct the future wind power series by 
inverse WT. The proposed approach is compared with the refer- 
ence to demonstrate its effectiveness regarding forecasting accu- 
racy and computation time. Similarly, in [55], Adaptive Wavelet 
Neural Network (AWNN) is carried out to forecast the wind speed 
in stage-I. Then, Feed-forward Neural Network (FNN) is used to 
transfer the wind speed forecast to the wind power forecast in 
stage-Il. 

In [56], standard multilayer Feed-forward Neural Network 
(FNN) is employed to forecast the wind speed. The non-linear 
and non-stationary wind speed series is first decomposed into a 
finite and often small number of Intrinsic Mode Functions (IMFs) 
and one residual series to obtain the stationary data series using 
the Empirical Mode Decomposition (EMD). FNN is performed to 
forecast the decomposed sub-series except the high frequency, 
whose input variables are selected by Partial Autocorrelation 
Function (PACF). Finally, the forecast sub-series are summed to 
reconstruct the forecast of wind speed series. The results show 


that the proposed approach outperforms the reference. Further- 
more, a similar approach based on ANN using EMD is found 
in [57]. 

Generalized Feed-forward Neural Network (GFNN) is used to 
forecast the annual wind speed probability density distribution in 
[58]. The same parameters required by the Weibull function are 
used as inputs. The proposed method shows better accuracy to 
forecast the wind speed probability density distribution than the 
Weibull function. 

Multilayer Feed-forward Neural Network (MFNN) is used to 
forecast a day-ahead wind power forecasting in [59]. The NWP 
model is established by coupling the Global Forecasting System 
(GFS) with the Weather Research and Forecasting (WRF) System. 
Furthermore, the Kalman filter is integrated to reduce the sys- 
tematic error from WRF. The proposed method is proved to be 
successful by the simulation. 

In [60], Kernel Ridge Regression (KRR) with active learning is 
performed for wind speed forecast. The active learning acts as a 
smart sample selector capable to yield compact forecast models 
and also to filter out noisy training samples. Three active learning 
methods to construct the training set for KRR are proposed; Pool 
of regressors (PAL), Distance from the closest training sample 
(DAL), and Residual regression (RSAL). The experimental results 
show that smart collection of training samples can be of benefit for 
wind speed forecast. 

ANN has been a good selection for wind speed and power 
forecasting. ANN is able to model a complex non-linear relation- 
ship and extract the dependence between the input and output 
through the learning process. It is simple to construct and only 
requires short development times, and does not require explicit 
mathematical expressions. ANN can be designed by setting up the 
network structure and then selecting a learning algorithm. 

The first step with ANN is to choose the network structure, 
where network structures derive from two basic classes: 1 - A 
feed-forward topology where information flows in only one 
direction, from inputs to outputs; 2 — A recurrent topology where 
information flows in both directions, from input to output and also 
from output to input. 

The second step with ANN is to select the learning algorithm 
and to learn the proper response. This is achieved with three 
learning paradigms: 1 - Supervised learning that sets ANN para- 
meters from training data; 2 - Un-supervised learning that 
sets ANN parameters based on given data and a cost function; 3 
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- Reinforcement learning that sets ANN parameters, where data is 
generated by interactions with the environment. Each learning 
paradigm has many learning algorithms. Some commonly used 
algorithms include evolutionary methods, gene expression pro- 
gramming, simulated annealing, expectation-maximization, non- 
parametric methods, and particle swarm optimization. 


2.1.2.3. ANN-Fuzzy approach. The Fuzzy logic approach is a non- 
linear mapping of input variables into the output by using 
numerical and linguistic values Fuzzy rather than fixed and 
exact. It used soft linguistic variable (small, medium, and large) 
and a truth variable that ranges in degree between 0 and 1. The 
Fuzzy logic approach is employed when the system is difficult to 
model accurately but an inexact model is available, because it 
allows to use the approximate values and incomplete or 
ambiguous data. However, using the Fuzzy logic approach alone 
is not satisfying because of its weak learning ability. 

The ANN-Fuzzy approach refers to combination of ANN and 
Fuzzy logic approaches. ANN is based on the learning and 
connectionist structure that performs well in low-level computa- 
tion with raw data. However, Fuzzy logic performs well with the 
human-like reasoning in high-level computation. Therefore, the 
combination of two approaches offers a promising approach to 
forecast the wind speed and power by compensating their weak- 
ness each other. 

In [61], ANN is combined with Fuzzy logic approach in order to 
optimize the best use of the NWPs. First, the ANN model provides 
the preliminary forecast of wind power based on NWPs. Then, the 
Fuzzy model estimates the quality of the forecasts provided by 
NWPs. Finally, this is exploited by an ANN model to provide the 
final forecasts. The simulation results show the validity of the 
proposed method. 

The hybrid Wavelet Transform (WT)-Particle Swarm Optimiza- 
tion (PSO)-Adaptive Network based Fuzzy Inference System 
(ANFIS) approach (hybrid WPA approach) is proposed for short- 
term wind power forecasting in [62]. The wavelet transform is 
performed to convert a wind power series in a set of constitutive 
sub-series. The transformed sub-series are separately forecasted 
by the ANFIS. The PSO is used to adjust the parameters of the 
membership function in ANFIS. The hybrid WPA approach outper- 
forms seven other reference approaches. 

The Simultaneous Perturbation Stochastic Approximation 
(SPSA)-based Multilayer Feed-forward Neural Network (MFNN) 
with Fuzzy inputs algorithm is presented in [63]. The SPSA is 
employed to achieve the training task of the MFNN. The inputs for 
the MFNN are modeled by triangle fuzzy numbers. A better 
performance of the proposed method is shown than the reference 
method. However, it cannot be used for multistep forecasting. 

ANN-Fuzzy approaches are complementary tools in building 
intelligent forecasting systems. While ANN performs well in low- 
level computational structures, Fuzzy logic deals with reasoning 
incorporated in the higher-level computational structures. Mer- 
ging of the ANN approach and the Fuzzy logic approach into one 
integrated system offers a promising approach to building wind 
speed and power forecasting systems. 


2.1.2.4. Etc. A Gaussian-Process-based method for forecasting the 
upper and the lower bounds of wind speed as well as the average 
is proposed in [64]. It makes use of the kernel machine technique and 
the Bayesian estimation. The simulation results show that they 
provide the better forecasting in consideration of the uncertainty 
than the reference. Furthermore, the advanced Gaussian-Process- 
based method, called the Sparse Heteroscedastic Gaussian Process 
(SPHGP) is proposed in [65]. This method is developed to make up the 
misestimation of the noise level from standard GP and reduces the 


computational costs from HGP. The 71,2 regularizer is employed for 
the sparsification. The simulation results demonstrate the effective- 
ness of the proposed method. 

In [66], the modified Taylor Kriging (TK) method for forecasting 
wind speed time series is proposed. The original TK method is 
developed for the spatial estimation. In this paper, the original TK 
method is properly modified for the time series forecasting. The 
simulation results indicate that the proposed method outperforms 
the ARIMA method. 

The Grey forecasting model (GM) is presented for short-term 
wind speed forecast in [67]. The Grey model is designed for the 
system analysis being characterized by inadequate information. 
The GM(1,1) Grey model, a single variable first-order Grey model, 
is used. The simulation results demonstrate the efficiency of the 
proposed model. 

In [68], two-stage hybrid network with Bayesian Clustering by 
Dynamic (BCD) and Support Vector Regression (SVR) is proposed. 
First, BCD clusters the input training dataset into several subsets 
with similar dynamical properties. Then, SVRs is used to model the 
training data in each subset having similar property. Experiments 
and comparisons with the persistence method demonstrate the 
effectiveness and efficiency of the proposed method. 

The forecasting of three-dimensional wind field is considered 
in [69]. To exploit the available second order statistics, the 
augmented quaternion statistics is employed and the adaptive 
Widely Linear Quaternion Least Mean Square (WL-QLMS) algo- 
rithm for the forecasting is used to allow for both three- 
dimensional wind model and the fusion of atmospheric para- 
meters. It is shown that additional information from the vertical 
wind speed and atmospheric parameters assist in providing better 
forecast. 

In [70], the Bayesian Model Averaging (BMA) approach with 
Markov Chain Monte Carlo (MCMC) is proposed for developing the 
long-term wind speed distribution. The derived BMA probability 
density function (PDF) is an average of many candidate models 
included in the model space weighted by their posterior prob- 
abilities over the sample data. The case studies reveal that none of 
reference distribution outperforms others; however, the BMA 
distribution is always suitable to describe the wind speed dis- 
tribution with high accuracy. 

The forecasting model integrating the concepts of structural 
breaks and Bayesian inferences is proposed for very short-term 
wind speed forecasting in [71]. It allows prior information about 
the wind speeds to be incorporated into the model and somehow 
boosts forecasting performance by considering structural breaks. 
The computational results confirm the advantages of the proposed 
method. However, there are several limitations; for example, the 
computing efficiency is not high and the size of training sample is 
not large enough. Further research is needed to cure these 
limitations. 

In [72], First-order and Second-order Adaptive Coefficient (FAC 
and SAC) methods are used to wind speed forecasting. Particle 
Swarm Optimization (PSO) is employed to select the most suitable 
parameter of FAC and SAC methods. Furthermore, Seasonal Expo- 
nential Adjustment (SEA) is employed to decompose the wind 
speed data into seasonal and trend components. Different combi- 
nations of these techniques are simulated. The combination 
obtains higher forecasting accuracy than the single FAC and SAC. 

Data mining approaches in wind energy system have more 
attention in recent years [12]. The time series models are built 
with data mining algorithms for the short-term forecasting in [73]. 
Five different data mining algorithms are employed; the Support 
Vector Machine Regression (SVMreg), Multilayer Perceptron 
(MLP), M5P tree, Reduced Error Pruning (REP) tree, and the 
bagging tree algorithms. The computational results of all time 
series models and data mining algorithms are compared and 
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discussed. Furthermore, this approach is applied to forecast the 
wind power ramp rates in [74]. One of the disadvantage of the 
proposed approach is that the time series model uses its own 
previously forecasted value. Therefore, the forecast accuracy 
decreases when the number of forecast steps increase. In [75], a 
data mining approach is expanded to forecast both short-term and 
long-term period. Furthermore, an advanced model, the ANN 
approach, is employed. The simulation results show that the 
model generated by an ANN outperforms all other models for 
both short-term and long-term forecast. 

The frequency domain approach is introduced for characteriz- 
ing the wind speed patterns in [76]. This approach separates the 
discrete waveform into a sum of sinusoids of different frequencies 
from wind speed patterns. It shows a potential to forecast future 
wind speed patterns with better accuracy than time domain. 
Furthermore, it also shows a potential to estimate the wind speed 
patterns for a target location using available data from the 
reference site. It is necessary to develop the appropriate forecast- 
ing system using the components obtained from the frequency 
domain approach. 


2.1.3. Combination approach 

The basic idea of the combination approach is to combine 
different approaches, retaining advantages of each approach. The 
desire is to improve the forecast accuracy, but combining forecasts 
does not always perform better than the best individual forecasts. 
However, in some cases it is viewed as less risky to combine 
forecasts than to select an individual forecast [77]. In this section, 
the combination forecasting method is only reviewed. The combi- 
nation of input data (NWPs) is discussed in Section 4. 

Several approaches have been developed based on the combi- 
nation of various models. The ANN-Fuzzy approach discussed in 
the previous section is one of the combination approaches. In [78], 
the final forecast is made through an adaptive linear combination 
of the alternative forecasts, where the weights given to each 
forecasting methods are based on their forecasting performance. 
For the combination, nine different time series models are con- 
sidered by varying the model parameters. The proposed combina- 
tion shows better performance than individual forecasts. 

The Anemos project considers the combination method, called 
Adaptive Exponential Combination (AEC) in [79]. In the first step, 
several combination methods are used, AEC being one of them. In 
the second step, the AEC method is used to combine the alter- 
native combinations of the first step. The results show the 
usefulness of the proposed method. 

In [80], a Multiple Architecture System (MAS) for wind speed 
forecasting is proposed. The different regression algorithms are 
considered as candidate models for making up the ensemble 
forecast; the Multiple Linear (MLR)-based regression, the Multi- 
layer Perceptron (MLP) neural networks, Radial Basis Functions 
(RBF) neural networks, and Support Vector Machines (SVM) 
regression. Three combination strategies are explored; simple 
average, weighted average, and non-linear fusion by means of an 
ANN method. The proposed combination strategies improve the 
performance with respect to the single forecast. 

In [81], two types of univariate statistical forecasting models 
based on the Least Squares-Support Vector Machine (LS-SVM) are 
proposed: the univariate LS-SVM model and the hybrid model of 
ARIMA and LS-SVM model. The hybrid approach is designed to 
capture both the linear ARIMA modeled and the non-linear 
patterns modeled by LS-SVM. However, there are no significant 
improvements between LS-SVM and the hybrid model. 

The combination approach between the Radial Basis Function 
(RBF) neural networks and persistence methods is considered in 
[82]. The experimental results show that the RBF method is better 


for monotonically wind speed changes while the persistence 
method is more suitable for random data with white noise. The 
simulation results demonstrate improvements in the combination 
approach. 

In [83], two combination approaches, namely ARIMA-ANN and 
ARIMA-SVM, are proposed. The ARIMA method is employed for 
modeling the linear characteristics and the ANN or SVM methods 
are employed for modeling the non-linear characteristics. The 
results show that the combination is viable for forecasting both 
wind speed and wind power. However, the combined approaches 
do not always outperform the single method. A similar combina- 
tion approach is also presented in [84]. 

In [85], two combination approaches are proposed; ARIMA- 
ANN and ARIMA-Kalman. ARIMA is used to decide the best 
structure of the ANN model and to obtain the best initial 
parameters of the Kalman model. Both combination approaches 
have good performance, which can be applied to non-stationary 
wind speed forecasting. 

The Bayesian Model Averaging (MBA) approach is proposed to 
combine the wind speed forecasts obtained from different ANN 
models in [86]. Three different ANN models are considered; 
Adaptive Linear Element (ADALINE) network, Back-Propagation 
(BP) network, and Radial Basis Function (RBF) network. The BMA 
approach weights individual forecasts based on their posterior 
model probabilities, with the better performing forecasts receiving 
higher weights than the worse. The simulation results show the 
effectiveness of the proposed combination approach. 

In [87], the combination of the Soft Computing Models (SCMs) 
and Similar Days (SD) methods is considered. Three SCMs models 
are considered, including Back-Propagation Neural Network 
(BPNN), Radial Basis Function Neural Network (RBFNN), and 
Adaptive Neuro-Fuzzy Inference System (ANFIS). The average of 
a selected number of similar wind speed days from SD methods is 
used for the input of SCMs and the application of SCM refines the 
results to obtain the final wind speed forecast. The simulation 
results show that all evaluated SCMs incur some level of perfor- 
mance improvement with the addition of the SD model. Further- 
more, the SD-ANFIS model outperforms all other individual and 
combined models. 

The combination between Seasonal ARIMA (SARIMA) and Least 
Square Support Vector Machine (LSSVM) is developed to forecast 
the average monthly wind speed in [88]. The SARIMA model is 
developed by using training data and the LSSVM model is devel- 
oped to describe the residuals from the SARIMA model. The 
simulation results show that the proposed method is simple and 
efficient. 

The main objective of the combination approach is to improve 
performance by taking advantage of the strengths of each model- 
ing approach. Combinations can consist of physical and statistical 
or alternative statistical approaches. The advantage of combining 
results from different modeling approaches is improved forecast 
accuracy. Furthermore, the combination of models reduces risks 
during extreme events such as storms, where some model types 
may have significant forecasting errors. 


2.2. Spatial correlation forecasting 


Typically, wind speed and power forecasting refers to the 
future, and the forecast horizon depends on power system opera- 
tion requirements. Spatial correlation forecasting, on the other 
hand, is mostly used for characterizing the wind resource at the 
site where sufficient information is not available but a neighboring 
measuring station is available. It is a useful indicator when 
performing wind energy potential assessments at sites without 
wind measurements. 
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Several approaches can be found regarding the problem of 
spatial correlation forecasting. The most commonly used approach 
is the Measure-Correlate-Predict (MCP) method [89]. It is widely 
used for future wind integration studies. The method is used for 
the long-term wind data at the target site using the wind data 
obtained from the reference site. Four different MCP methods are 
introduced: Linear Regression, Matric MCP, Weibull Scale, and 
Wind Index MCP model. It is found that all perform well, but the 
performance is of course dependent upon the condition of the 
available data. 

Furthermore, in [90], the MCP method is used to forecast the 
long-term wind speed and then, the Monte Carlo based numerical 
simulation is performed to utilize the probability models at the 
target site. This model is used for assessing the uncertainty of 
overall wind energy potential prior to the construction of wind 
turbines. For the case study it was found that the method 
effectively evaluated the expected annual energy production at 
the site. 

The Bayesian hierarchical model is developed to characterize 
the wind resource at the desired location using the known 
resource [91]. The hierarchical model has two levels. The wind 
speed data from the reference site is defined as the sum of a 
temporal, a spatial, and error components together at the first 
level. At the second level, a temporal and a spatial component are 
modeled as a first order random walk and a multivariate normal 
distribution respectively. It is found that the Bayesian inference 
can be a useful tool in spatial correlation forecasting. 

In [92], the wind speed at “upwind” remote sites can be used 
for forecasting at “downwind” sites. The ANN approach is used to 
forecast the wind speed at the target site using the “upwind” and 
“downwind” site measurements. It is found that the short-term 
wind speed forecasting is improved when the recent measure- 
ments from the reference site are used. 

In [93], the ANN approach is used to estimate the wind speed at 
the target site using the wind speed at a strong correlation site 
among nearby sites. A high value of the Sample Cross Correlation 
Function (SCCF) indicates the strongest correlation site. The 
simulation results indicate that higher SCCF values between two 
sites leads to better wind speed estimates. However, this method, 
contrary to other methods, requires the measurements from both 
the target and reference sites to calculate their SCCF. 

A Takagi, Sugeno, and Kang (TSK) Fuzzy approach is proposed 
to forecast the wind speed and power at the target site using the 
wind speed and direction measured at neighboring sites in [94]. 
The Genetic Algorithm (GA) based learning scheme is employed to 
achieve the training task of the model. The performance is 
evaluated at various terrain cases and demonstrates significant 
improvements over the persistent model. 

In [95], a method for long-term wind speed and power 
forecasting is proposed by using the NWPs provided at nearby 
sites of the wind park. Three types of local recurrent neural 
network models, depending on the neuron dynamic model, are 
employed: the Infinite Impulse Response Multilayer Perceptron 
(IIR-MLP), the Local Activation Feedback Multilayer Network (LAF- 
MLN), and the Diagonal Recurrent Neural Network (DRNN). 
Furthermore, two learning schemes for updating the weights of 
the model are employed; the Global Recursive Prediction Error 
(GRPE) and the Decoupled RPE (DRPE). It is shown that the 
proposed methods outperform the static and the persistence 
models. 

The ANN approach is used to estimate the wind speed at a 
target station in [96]. The wind speeds recorded at neighboring 
measuring stations are used as signals for the input layer of the 
model. It is found that as the number of reference stations 
increase, the estimation errors tend to decrease. Furthermore, 
when the wind direction is used as an input, greater error 


reduction is achieved than when the wind direction signal is 
omitted. 

Being different from traditional wind speed and power fore- 
casting, spatial correlation forecasting takes into account the 
spatial relationship of wind speed and power from different sites. 
The spatial correlation forecasting requires wind speed and power 
measurements from multiple spatial correlated sites, where the 
measurements often involve time delays. The measurements and 
their time delays adds complexity and cost to the implementation 
of spatial correlation forecasting. Spatial correlation forecasting 
can either employ physical models, which take into account the 
terrain information, or statistical models, which take spatial 
correlation information into account. 


2.3. Regional forecasting 


Regional forecasting is to forecast aggregated power output 
from a region where a number of wind turbines are sited. The 
regional forecasting is required for the market operator to estimate 
the total wind power production in a region. It provides a faster 
approach than calculating the sum of the wind power forecast 
from each wind farm. Furthermore, it is more accurate because of 
its spatial smoothing effects discussed in Section 4. Several 
approaches are proposed for regional forecasting. 

Linear up-scaling in Previento is used for the regional forecast- 
ing in [36]. The ratio between the fluctuation of the wind power 
output of the measured single site and the ensemble of the sites is 
used for up-scaling. It shows how this method works practically in 
Germany. 

In [97], the aggregated forecasting method using the distances 
between wind speed forecasts for a set of selected coordinates to 
calculate wind power forecasts in a region is proposed. Once the 
distance is obtained, the wind power forecast is calculated by 
comparing the new wind speed forecast with the stored historical 
wind speed forecasts and their corresponding total wind power 
measured in the region. The accuracy of the proposed method is 
verified by comparing the results with the reference method. 

In [98], an ANN-Fuzzy approach based on the up-scaling 
approach is used to forecast the regional wind generation. Three 
different approaches are introduced based on how much the 
NWPs and on-line SCADA data are available: FNN up-scaling, 
Cascaded model, and FNN cluster model. The results show that 
the overall performance is improved but it is difficult to beat the 
persistence model for the first look-ahead time. SCADA data is 
necessary to enhance the performance in the first forecast horizon. 

A finite state Markov Chain model is developed to forecast the 
aggregated power output from a wind farm in [99]. The model 
takes into account both the spatial and temporal dynamics of wind 
power output. For spatial dynamics, a rigorous step-by-step 
procedure to characterize the probability distribution of the 
aggregated power output is presented. The temporal dynamics is 
characterized using auto-regression analysis tools. 

The models of regional forecasting are mostly based on the up- 
scaling approach. It is difficult to obtain the wind power measure- 
ments and NWPs from all the wind farms in a region. Therefore, 
the up-scaling approach is used to scale the available on-line 
measurements and NWPs up to the region. There is not much 
work reported dealing with regional forecasting. When wind 
energy penetration increases significantly good solutions to regio- 
nal forecasting will be needed. 


2.4. Probabilistic forecasting 
The forecasting models discussed so far provide the point 


forecast for how much wind speed and power can be expected 
for a number of hours into the future. It is important to know not 
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only the point forecast but also its expected uncertainty. Knowing 
the uncertainty of the forecast enables the system operator to 
assess the risk of the point forecast. Different approaches can be 
found in the literature regarding the problem of probabilistic 
forecast. In this section, some of the representative methods are 
discussed. 


2.4.1. Parametric approach 

If there is any underlying assumption on the distribution one 
tries to model for the forecasting error, it is a parametric approach. 

A Beta Probability Density Function (PDF) is used to model a 
PDF of the wind power forecast for the persistence model in [100]. 
First, the forecast results are divided into power classes or bins for 
modeling the distribution of power with the Beta PDF. The overall 
forecast error PDF is obtained from adding up the calculated error 
PDFs of all bins. The results show the usefulness of the proposed 
forecast error PDF for finding the optimal storage size. 

The Cauchy distribution is proposed as a model distribution for 
the forecast error for the persistence model in [101]. The shape of 
the distribution is found to change significantly with the forecast- 
ing horizon. The effect of the proposed wind power forecast 
uncertainty is illustrated. 

There is little literature available on the parametric approach 
due to the difficulty of defining the stochastic nature of wind 
speed. 


2.4.2. Non-parametric approach 

Contrary to the parametric approach, the non-parametric 
approach does not rely on any assumption about the distribution. 

A non-parametric approach based on kernel density estimation 
is proposed to provide the complete wind power forecast dis- 
tribution in [102]. The distribution of wind power is decomposed 
into a continuous part corresponding to all occurrences of wind 
speed having zero production and a discrete part corresponding to 
the discrete probabilities of all production values. The method 
levels with the references while being fast and producing the 
complete PDF. 

The kernel density estimation is expanded to a time-adaptive 
model and to adopt different kernels for several types of variables 
in [103]. The kernel density estimation model based on the 
Nadaraya-Watson estimator is proposed to estimate the uncer- 
tainty in short-term wind power forecasts. It includes the adoption 
of specific kernels for the explanatory variables and the develop- 
ment of a time adaptive model. It confirms its value from testing 
with real data from two large wind farms in the U.S. and shows 
better results based on the evaluation criteria in [104]. Further- 
more, kernel density estimation based on the time-adaptive 
quantile-copula estimator is presented in [105]. 

In [106], the generic methodology for assessing the forecast risk 
is presented. First, confidence intervals based on the resampling 
approach are calculated by considering the forecast horizon, the 
power class and the cut-off risk. Furthermore, the Meteorological 
Risk Index (MRI) evaluating the weather stability is used to tune 
the confidence intervals. The proposed method is generic and can 
be applied to any wind forecasting models. However, it does not 
seem to have very much influence on the quantile estimation in 
[107]. The most influential variable is the forecasted wind power 
production. Furthermore, the results show increased uncertainty 
for westerly winds. Also, the effect of the horizon on the quantiles 
is minor. 

Probabilistic wind power forecasts using Local Quantile Regres- 
sion (LQR) is proposed in [108]. This approach does not require the 
distribution assumption and is easy to include more predictive 
information. The simulation results demonstrate its effectiveness. 


In [109], a probabilistic forecasting method based on the 
Markov Chain model is proposed. This provides not only the 
estimates of future wind power generation but also the associated 
probability distributions. First Order Markov Chain (FOMC) and 
Second Order MC (SOMC) are considered. The proposed models 
are compared with the persistence model to evaluate forecast 
errors. 

In [110], the proposed probabilistic model is designed to 
forecast eight quantiles of the wind power distribution using 
multiple Radial Basis Function Neural Networks (RBFNNs). The 
main idea is to exploit all information influencing the forecasting 
errors in order to estimate the wind power density function. 
The three uncertainties considered are the weather stability, the 
structure of the forecasting model, and the NWP accuracy. The 
developed RBFNNs are trained with the Ordinary Orthogonal Least 
Square (OLS) algorithm and their performance is improved using a 
Particle Swarm Optimization (PSO) algorithm. The simulation 
results show that the proposed method performs well in different 
weather conditions and different terrains. 

In [111], the sparse Warped Gaussian Process (WGP) model is 
formulated to provide short-term probability forecasts of the wind 
power generation. It converts a non-Gaussian wind power series 
to a latent series, which is well modeled by a Gaussian process. 
Furthermore, a sparsification method is employed to reduce the 
computational costs of the model. The simulation results validate 
the effectiveness of the proposed model. 

Non-parametric approaches are used when it is not possible to 
formulate the distribution of the forecast errors. This distribution- 
free approach is appealing since it is difficult to define the 
stochastic nature of wind. Furthermore, the wind speed and power 
forecast are very different depending on the forecast time horizon 
and location. Non-parametric approaches can be applied in this 
case, and as a result are suitable to estimate the uncertainty of the 
wind speed and power forecast. 


2.4.3. Using ensemble forecast 

Ensemble forecasting methods are designed to take advantage 
of ensemble NWPs. The ensemble NWPs are obtained by running 
differently calibrated NWP models or by slightly varying the initial 
conditions, in order to obtain a set of different forecasts [112]. 
Different types of meteorological ensemble forecasts are provided 
either by the European Centre for Medium-range Weather Fore- 
casts (ECMWF) or the National Centre for Environmental Predic- 
tion (NCEP). A MultiScheme Ensemble Prediction System (MS-EPS) 
is introduced to utilize the ensemble NWPs for short-term wind 
power forecasting [113]. 

In [114], skill forecasting based on ensemble forecasts of wind 
power production is introduced to estimate the expected level of 
forecast uncertainty. Prediction risk indices based on the disper- 
sion of wind power ensembles over a single forecast horizon or 
over a set of successive look-ahead times are introduced. Then, 
these indices are used to forecast the distribution of expected 
forecast errors as skill forecasts. The simulation results show that 
the proposed skill forecasting can provide useful information on 
the expected level of forecast uncertainty. 

The wind power density forecasts using the calibrated and 
smoothed ensemble forecast are proposed in [115]. The systematic 
bias of the ensemble forecasts is corrected by the calibration 
approach that also incorporates kernel smoothing, with para- 
meters optimized using maximum likelihood. The proposed power 
density forecasts provide more accurate results for wind power 
density and point forecasting than the reference. 

The ensemble forecasts give a much better idea of what 
weather events may occur in the future. By comparing different 
forecasts, it identifies the expected spread of weather conditions 


J. Jung, R.P. Broadwater / Renewable and Sustainable Energy Reviews 31 (2014) 762-777 771 


and assesses the probability of particular weather events. When 
the forecasts vary significantly there is a lot of uncertainty. 
However, if forecasts are very similar, then more confidence can 
be placed in the forecasts. The ensemble forecast is useful in both 
wind speed and power forecasting. 


2.4.4. Additional models 

In [116], the relation between typical weather situations and the 
corresponding wind speed forecast error is investigated to model 
the uncertainty of the forecast. First, the Cluster analysis using the 
Principal Component Analysis (PCA) rather than complex meteorolo- 
gical information is used to group days with similar meteorological 
conditions into classes. Then, for each of these groups the forecast 
error is determined separately. It is found that different meteorological 
situations have significant differences in the forecasting error. 

In [117], it is necessary to assess both temporal and spatial 
interdependences in order to evaluate the aggregate error for the 
regional uncertainty estimates. In this manner, the spatial model- 
ing of the wind power forecast uncertainty is proposed, which 
is important for the optimal management of power flows in 
the large region. The improvement of the probabilistic forecasting 
is discussed. 

The spatio-temporal modeling of wind power forecast uncer- 
tainty considering the interdependence of both time and space is 
proposed in [118]. It is shown that there is significant cross- 
correlation between forecast errors obtained from neighboring 
areas with lags of a few hours. These cross-correlation patterns 
characterized by wind speed and direction show that a higher 
prevailing wind results in a stronger dependency on remote 
places; while a lower prevailing wind results in a stronger 
dependency on local places. A further study is needed for the 
generalization by the methodology. 

In [119], the uncertainty of wind power forecasting regarding 
the meteorological situation is investigated. From the simulation 
results, the uncertainty of wind speed forecasts is not dependent 
on the magnitude of the wind speed forecast. Furthermore, the 
uncertainty is larger for low pressure situations than high pressure 
situations. However, some sites do not show these differences. 

A framework for evaluating the quality of probabilistic fore- 
casting of wind power is presented in [104]. The framework is 
composed of measures and diagrams, with the aim of providing 
useful information on each of the properties including reliability, 
sharpness and resolution, and a unique skill score. The framework 
is applied for comparing the quality of two non-parametric 
methods; adaptive quantile regression and adapted re-sampling. 
The relevance and interest of the proposed framework are 
discussed. 

In [120], a methodology for using the variability in wind speed 
forecasts at multiple grid points in NWP systems is presented to 
forecast the uncertainty in future wind power generation. The 
basic concept is that a major source of uncertainty in NWP 
forecasts arises from potential misplacement of weather features. 
The comparison between the proposed method and NWP ensem- 
bles is performed. It demonstrates good accuracy in forecasting 
large forecast errors. 

A wide comparison is carried out between the ARMA models, 
five kinds of ANN models, and Adaptive Network based Fuzzy 
Inference System (ANFIS) models in [121]. The comparison looks at 
many forecasting methods, time horizons and a performance 
analysis. A further analysis of the statistical distribution of normal- 
ized errors from the forecasting models is performed. 


2.5. Offshore forecasting 


Nowadays, offshore wind has a lot of attention and is expected 
to be installed more in the future because of more frequent and 


powerful winds. However, the offshore wind forecasting is still in 
the early phases of the development. There are not many available 
specific forecasting models for offshore in the literature. Further 
advances in offshore forecasting will help offshore wind penetra- 
tion levels. 

Most forecasting models have been designed for onshore. For 
the physical approach to offshore forecasting, the special meteor- 
ological characteristic of the marine atmospheric boundary layer 
has to be considered including the stability of the boundary layer, 
wind-wave interaction, etc., [122]. However, the statistical 
approach does not need to consider the precise offshore condi- 
tions. Results from various offshore forecasting methods are 
investigated in [123]. 

In [124], the evaluation of NWPs forecasts for offshore sites is 
performed. Then, for the physical forecasting approach, the special 
meteorological characteristics of the marine boundary layer are 
considered to calculate the wind speed at the hub height of the 
wind turbines by using the derived vertical wind profile. The 
effectiveness of the proposed method is demonstrated by compar- 
ing with measurements. 

In [61,114,125,126], some of the statistical approaches use 
offshore data as an input, but the specific technique for dealing 
with offshore data is not discussed. 


3. Accuracy of wind speed and power forecasting 
3.1. Reference models 


A number of reference models have been introduced as bench- 
mark models. These models may be used for initial testing of the 
accuracy of new approaches to wind forecasting. A number of 
existing reference models are reviewed in this section. 


3.1.1. Persistence model 

The persistence model is a commonly used benchmark model. 
In this model the future wind power will be the same as occurred 
in the present time step as given by 


Pre = Pe (2) 


where P, +k İS the forecast at time t for the look ahead time k and 
P; is the measurement at time t. This model performs well for 
short forecasting horizons. 


3.1.2. Weighted sum between persistence and the mean power 
production 

The persistence model performs well for the short-term 
forecast. However, it is not reasonable to use the persistence 
model when the forecast time horizon is more than a few 
hours. For this case, a new reference model has been proposed 
[127]. It is a weighted sum between the persistence and the mean 
given as 


Pra kt = akPt+ (1 — ap)P (3) 


where P, is the measurement at time t, P is the estimated mean, 
and a, is the correlation coefficient between P, and P;.,, defined as 


C/N) ON HP Pek 
ak = -7 
(1/N)SN-*P, 


t=1 


(4) 


where 
P,=P;—P (5) 
3.1.3. OL-Persistence model 


For regional forecasting, total regional power is seldom avail- 
able. Therefore, the On-Line (OL) Persistence model is introduced 
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for the regional forecasting as a benchmark model [32]. This model 
is the sum of the production of the representative wind farms with 
measurements, scaled to the total wind power and is given as 
areg pres Mr 

Pecae= Saag Pie (6) 


M pwf . 
iP insta i= 1 


where P;"* is the total installed capacity in the region, Pe is the 
installed capacity of the M, wind farms for which on-line mea- 
sures are available, and p% is the power measured at those wind 


farms at time t. 


3.2. Model performance evaluation 


To determine the model performance in terms of the forecast- 
ing accuracy, several model accuracy measures are discussed in 
this section. 

The forecast error is the difference between the measurement 
and the forecast and is given as 


etyre = Peyr — Êro rt (7) 


Sometimes, the normalized forecast error is used. The forecast error 
is normalized by the installed capacity (Pinst) as 
Le 8 
E = 
t+kit Pinst t+kit 

To assess the quality of the forecast, several evaluation criteria 
are used; Bias, Mean Absolute Error (MAE), Mean Absolute Per- 
centage Error (MAPE), Mean Square Error (MSE), Root Mean Square 
Error (RMSE), and Standard Deviation of Errors (SDE) [128,129], as 
given by 


P — IN 
Bids, =AV8(e + it) =N x Cre kt (9) 
t= 
1 N 
MAE: =Ñ È Jerse] (10) 
t=1 
1 N lecy nie 
MAPE, = — 11 
r N> Ptk ( ) 
1 N 2 
MSE =Ñ È (erko) (12) 
t=1 
1N 2 
RMSE; = V MSE; = N > (€¢+- kit) (13) 
t=1 


1 N = 
SDE, = Std(ey + kit) = N 2 (t+ Kt — e? (14) 
\VN;= 


where N is the total number of samples, and Avg(-) and Std(-) 
denote the average and the standard deviation respectively. 
Furthermore, the RMSE, can be expressed by the combination 
between Bias, and the SDE, as 


RMSE? = Bias? + SDE? (15) 


In the model comparison, it is important to quantify the 
improvements of the advanced model over the benchmark model. 
In this manner, the improvement with respect to the reference 
model is introduced as [128,129] 


EC pef k = ECaav,k 
E Cref „k 


where EC stands for the evaluation criteria such as MAE, RMSE, 
SDE, etc. 


Impecp = (16) 


3.3. Forecast accuracy 


Using the advanced forecasting models introduced in Section 2, 
it is possible to provide better forecasts of wind speed and power 
than the reference models. However, it is very difficult to say 
which model is the best because of its site dependency. Although 
the forecasting model performs well, it does not guarantee that 
the model works well at another site. Therefore, how the general 
forecast accuracy varies is discussed in this section. 


3.3.1. The accuracy by forecast time horizon 

The forecast accuracy decreases when prediction length 
increases. The MAE of short-term forecasts is typically in the range 
of 5-15% and then the errors increase rapidly with an increase in 
the forecast time horizon. Thus, the MAE is typically in the range 
of 13-21% range for 1-2 days ahead and rises to 20-25% range 
after about 3 days [10]. 


3.3.2. The accuracy by location complexity 

In [79], the average value of the normalized MAE (NMAE) for 
the 12h forecast horizon is compared based on the terrain 
complexity of the wind farm. Much smaller NAME values in a flat 
terrain wind farm (low complexity location) are observed than in 
complex terrains (high complexity location). There is a significant 
increase of the NMAE value when the terrain complexity is 
increased. Furthermore, offshore wind farms have slightly higher 
NMAE values relative to the flat terrain wind farm. 


3.3.3. The accuracy by seasonal variability 

In [130], the performance of the forecasting models is related to 
the seasonal variability. Smaller forecasting errors are observed in 
winter than summer because of the higher level of wind speeds 
and the larger uncertainty in the summer stormy weather situa- 
tions, in particular low pressure systems with fast moving frontal 
zones containing complicated wind patterns. 


3.3.4. The accuracy by weather condition 

In [106], the forecasting error is compared as a function of the 
Meteorological Risk Index (MRI) evaluating the weather stability. 
The forecasting error increases linearly when MRI increases (the 
higher MRI represents unstable weather regimes). Furthermore, in 
[119], the forecasting uncertainty does not depend on the wind 
speed but on the pressure. The forecasting error in the low 
pressure situation is larger than the high pressure situation. These 
patterns are also observed in [116]. 


4. Improvement in forecasting performance 


Development activities are on-going in reducing the forecasting 
error in wind speed and power forecasting, spatial correlation 
forecasting, regional forecasting, probabilistic forecasting, and off- 
shore forecasting. In this section, some of the potential options to 
help improve the forecasting accuracy are discussed. 


4.1. Kalman filtering 


It is well known that the accuracy of NWPs has a major impact 
on the forecasting of wind speed and power. Therefore, reducing 
the uncertainty of NWPs can lead to significant forecasting 
improvements. To improve the accuracy of NWPs, the Kalman 
filtering algorithm is applied to filter out systematic errors. The 
Kalman filtering algorithm provides the statistically optimal esti- 
mate recursively by combining recent weighted observations 
which minimize the corresponding biases. 
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In [131], the improvements in wind speed forecasting with the 
application of the Kalman filtering are investigated. Third-order 
polynomial functions are implemented in the Kalman filtering 
algorithm. For the evaluation, two NWP models having different 
characteristics and horizontal resolutions are tested. The simula- 
tion results show that the forecasting accuracy is significantly 
improved by using the filtered NWPs. The significant forecast 
improvement using the Kalman filtering can be also found in 
[59,132,133]. 


4.2. Optimal combination 


The combination of different NWPs or forecasting models takes 
advantage from each model. The advantage of combining the 
models, of course, improves the forecast accuracy. Furthermore, 
the combination can reduce the risk from extreme events. The 
combination of forecasting methods is discussed in Section 2, and 
the optimal combination can significantly reduce forecast error. 
Here, the optimal combination of NWPs is discussed. 

In [134,135], a classification scheme is used to find optimal 
combinations for specific weather situations. The different NWP 
models are exploited by applying the optimal weights depending 
on the prevailing weather conditions. It shows that the optimal 
combination of NWPs significantly outperforms others even if the 
single NWP is already very well optimized. The combination of 
different NWPs is performed based on the quality of individual 
forecasts in [125,136]. 

In [137], the combination between measured data and multiple 
wind forecasting models is proposed by advanced machine learn- 
ing techniques. The input sources are the wind forecasts profiles, 
derived by synoptic and local atmospheric models, and the actual 
climatic variable evolution measured by local environmental 
stations. The simulation results demonstrate the effectiveness of 
the proposed data fusion algorithms. 


4.3. Spatial smoothing effects 


Under operational conditions, regional forecasting is required 
for the system operator to estimate the total wind power produc- 
tion. The fluctuations of the combined power output of distributed 
wind farms are damped, which results in decrease in fluctuations 
of the regional power output compared to the forecast for single 
sites. This is called spatial smoothing effects. These effects provide 
the motivation for developing regional forecasting models rather 
than calculating the sum of the wind power forecast from single 
sites. Several forecasting models are discussed in Section 2.3. The 
reduction of the forecast error is found in the literature [138,139]. 


4.4, Statistical downscaling 


NWPs are primary inputs to forecasting models. So improving 
NWPs can help to improve wind speed and power forecasting. 
NWPs are only available for grid points which generally corre- 
spond to areas that are larger than the wind farms themselves. 
Therefore, estimating the wind speed at the wind farm location 
can provide more accurate wind speed forecast as an input to the 
forecasting models. The basic idea of statistical downscaling is to 
use higher resolution calculations by adding more complete 
physics descriptions to capture more precise local conditions. 
The analysis results show that wind power forecasting using 
the statistical down scaled NWPs can improve the performance 
[140-142]. Furthermore, some of the advanced NWP models are 
examined in [143-145]. 


4.5. Input parameter selection 


The selection of the input parameters for the forecasting model 
is crucial to the performance of the forecasts. Several input 
parameters are investigated in the literature for the purpose of 
reducing the forecasting error. 

In [146], the inclusion of off-site observations with NWP 
models can increase the forecast accuracy. Despite the inherent 
large forecast error from NWPs, adding off-site observations leads 
to increases in forecast accuracy. Furthermore, as the number of 
reference stations providing the measurements increase, the 
forecasting errors tend to decrease [96]. The improvements by 
the off-site observations can be the motivation for developing 
spatial correlation forecasting which forecasts wind speed and 
power at target sites by using the observations from neighboring 
measuring stations, as described in Section 2.2. 

Several meteorological parameters as inputs have been con- 
sidered for improving the performance of the forecast. The 
inclusion of wind direction with wind speed can reduce the 
forecasting error [96,116,118,126]. Sometimes the wind vector 
applying both wind speed and direction is used [45,52,69]. Besides 
the obvious importance of the wind speed as an input parameter, 
temperature and pressure can help improve forecasts [69,116,147]. 

In [148], the inclusion of the wind speed forecast at wind 
turbine hub height shows significant improvements in forecast 
accuracy. 

Furthermore, transformed data rather than wind speed itself 
show potential for improving the forecast. Several transformation 
techniques are used: wavelet decomposition [41,42,54,55,62], 
empirical mode decomposition [56,57], and frequency domain 
approach [76]. 


4.6. Power curve modeling 


When converting the wind speed to wind power production, 
inaccuracies in the non-linear relation leads to further errors. 
Wind power and wind speed are related through a cubic relation- 
ship. Thus, small variations in the wind speed result in much 
larger deviations in the wind power. The use of the certified 
manufacture power curves does not guarantee accurate conver- 
sions. Using an accurate relation between the wind speed and 
power can greatly minimize conversion errors when converting 
the wind speed forecast to the wind power forecast. 

The conventional wind power curve model is extended to a 
wind farm model by the addition of the wind direction depen- 
dence with the wind speed dependence. Such a two dimensional 
wind farm power curve model may help to provide more accurate 
relations between wind speed and power [50,140,142]. 

Statistical techniques rather than the wind power curve may be 
used to describe the non-linear relation between wind speed and 
wind power. The polynomial regression is used to estimate the 
wind power by using the explanatory variables such as wind 
speed, direction, etc. Furthermore, non-linear hyperbolic functions 
containing trigonometric terms which describe the non-linear 
behavior are used [133]. Sometimes, advanced statistical 
approaches (ANN, Fuzzy logic, etc.) are used to estimate wind 
power from the various inputs [55]. However, such a relation does 
not need to be described when the wind power is forecasted 
directly from the raw input data. However, it is required when 
forecasting the wind speed and then converting it to the power. 


4.7. Forecasting parameter 
One of the most important decisions in a wind power system 


is choosing which parameters will be used in the forecast. The 
forecaster can choose the wind speed alone or the wind speed and 
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direction or the wind vector as the forecasting parameters. Then, 
the forecasting wind information is converted to the wind power 
production. Or the forecaster can forecast the wind power produc- 
tion directly. Furthermore, the forecaster can choose regional or 
probabilistic forecasting. Choosing the appropriate forecasting 
parameters can improve the performance [149]. 


5. Conclusion and future works 
5.1. Discussion and conclusion 


Today's power system is facing growing challenges in main- 
taining a secure and reliable energy supply. Part of the growing 
challenge is the possibility of significant levels of uncertain wind 
generation being installed on power system. This brings new 
challenges involving the management of wind intermittence. 
Accurate forecasting of wind power is necessary to meeting these 
challenges. 

The main objectives of wind speed and power forecasting are 
to estimate the wind speed and power as quickly and accurately as 
possible. With more accurate forecasting integration of large 
amounts of wind power into power system can be achieved. 
Accurate forecasting tools reduce the financial risk and lead to 
improved scheduling and unit commitment plans. 

This paper provides an overview of forecasting tools that are 
available for wind generation. The wind speed and power fore- 
casting approaches reviewed have different characteristics, with 
different approaches providing better results for different forecast 
time horizons and locations. This paper divides forecasting meth- 
ods into five categories: wind speed and power forecasting, spatial 
correlation forecasting, regional forecasting, probabilistic forecast- 
ing, and offshore forecasting. An overview of existing research in 
each category is presented. 

Several wind speed and power forecasting approaches, includ- 
ing physical, statistical, and combination approaches, have been 
implemented and are now commercially available. The statistical 
approaches provide good results in the majority of cases, including 
short-term, medium-term, and long-term forecasting. However, in 
the very short-term and short-term horizon, the influence of 
atmospheric dynamics becomes more important, so that the use 
of the physical approaches becomes necessary. 

The statistical forecasting approaches, such as conventional 
statistical approaches, ANN, and ANN-Fuzzy, use large amounts of 
historical data as input. The conventional statistical approaches are 
mostly aimed at very short-term and short-term forecasting. ANN 
is able to model complex non-linear relationships and extract the 
dependence between the inputs and outputs through learning 
and training. Fuzzy logic outperforms others when dealing 
with reasoning problems, while learning and training abilities 
are mediocre. The merging of the ANN approach with the Fuzzy 
logic, creating the ANN-Fuzzy approach, provides excellent perfor- 
mance. 

It is difficult to say which model is the best because different 
models perform differently in different situations. The combina- 
tion approach attempts to leverage the strengths of different 
modeling approaches. This approach is able to not only improve 
the forecast accuracy, but also reduces the risk from extreme 
events. 

In order to select a suitable location of wind generation, it is 
necessary to assess the wind power sufficiency of the neighboring 
regions. In this case, spatial correlation forecasting may be applied 
to estimate the characteristics of the wind resource at sites where 
insufficient information is not available. Current models use 
functions that describe the wind flow depending on the distance 
between two sites by selectively using the topography of the area, 


the height of the measuring mast, the cross correlation factor, or 
other relative meteorological data. These models are more difficult 
than typical time-ahead forecasting models because they require 
measurements from many spatially correlated sites and the 
associated measurement delay times. 

Regional forecasting has become more important for market 
operators to estimate the total wind power production in a region. 
Furthermore, it has been shown that regional forecasts provide 
faster and more accurate results than single farm forecasts because 
of the spatial smoothing effects. However, few studies have looked 
into the particularities of regional forecasting. When wind energy 
penetration increases significantly solutions to this problem will 
be needed. 

Probabilistic forecasting gives an estimate of where the wind 
production will lie within a certain degree of confidence. In this 
review some of the representative methods are considered, 
including the parametric approach, non-parametric approach, 
and ensemble forecast. There are not many parametric approaches 
available due to the complex stochastic nature of wind speed. 
Thus, non-parametric approaches, and distribution-free 
approaches, have been more appealing. In addition, ensemble 
forecasting models for probabilistic forecasting are used in order 
to obtain the expected spread of weather conditions and assess the 
probability of particular weather events. 

In future, major wind power developments are expected to be 
installed offshore. More frequent and powerful wind speeds 
represent the main advantages of offshore power production. 
However, offshore wind forecasting is in an early phase of 
development. All state-of-the-art forecasting models were origin- 
ally designed for onshore. For physical forecasting approaches 
special modeling considerations have to be considered for off- 
shore. However, with statistical approaches precise knowledge of 
offshore conditions is not needed. 

Using advanced wind speed and power forecasting approaches 
introduced in Section 2, it is possible to provide significantly better 
forecasts than the simplistic or reference forecasting approaches. 
However, it is difficult to say which model is the best because wind 
speed and power forecasting is not “plug-in-play” due to the site- 
dependence. Therefore, an evaluation of accuracy relative to the 
forecasting model performance is discussed. The general forecast- 
ing accuracy is then considered along with the level of forecasting 
errors that can be expected depending on the forecast time 
horizon, location complexity, seasonal variability, and weather 
conditions. The forecasting accuracy goes down with increasing 
forecast time horizon, terrain complexity, and unstable weather 
regimes. 

Finally, several potential techniques for improving the accuracy 
of the models are discussed: (1) Kalman filtering of systematic 
errors coming from NWPs. (2) The combination of different NWPs 
or forecasting models. (3) Combining power outputs of distributed 
wind farms providing spatial smoothing effects. (4) Estimating 
the wind speed at wind farm locations by statistical down 
scaling. (5) Appropriate selection of input parameters. (6) 
Improved power curve modeling. (7) Choosing the appropriate 
forecasting parameters. 


5.2. Future work 


The reviewed wind speed and power forecasting approaches in 
this paper provide a foundation for future research. Future work to 
be considered includes the following: 


è Further improvements in both physical and statistical forecast- 
ing approaches and improved approaches for the combination 
of different forecasting approaches can be expected to further 
reduce forecasting errors. 
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è Extending the forecasts further into the future, enabling long- 
term planning. 

© Regional forecasting is useful in feasibility studies of prospec- 
tive installations and assessment of the potential impact on 
system reliability. The choice of reference sites and their 
combinations should be considered in regional forecasting. 

© Forecasting the aggregated wind power on a regional basis 
would be useful for management of total wind power produc- 
tion. It would also be useful to detect extreme events so that 
market operators can manage trading as wind power ramps up 
and down. 

© Probabilistic forecasting could provide probability distributions 
of the forecasting errors, providing the opportunity to reduce 
the reserve capacity for balancing wind power forecast errors. 

© The development of operational ensemble forecasts using the 
data from several NWPs would help improve the forecast 
accuracy. 

© Improved offshore forecasting models are needed to improve 
the offshore forecasts that are currently being done with 
models developed for onshore. 

© New approaches for complex terrains are required. These 
approaches could incorporate higher resolution maps. 

e Improving and tuning the NWP model to predict the hub- 
height wind would help improve the forecasting accuracy. It is 
sometimes required to forecast accurate pressure, temperature, 
and other related weather parameters. 

e Additional use of online wind measurement data has the 
potential for improved forecasts, especially for very short- 
term and short-term wind power forecasting. 

© Further research on automatic adaptive parameter estimation 
is required since wind speed and power forecasting is very site- 
dependent. 

è Increasing the spatial and time resolution of NWPs, better 
taking into account local phenomena, would help to improve 
the forecast. 

© Forecasting extreme events, as opposed to common behavior, 
would be very valuable to protecting the power system. 
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